Dengue disease prediction using weka data mining tool

نویسندگان

  • Kashish Ara Shakil
  • Shadma Anis
  • Mansaf Alam
چکیده

Dengue is a life threatening disease prevalent in several developed as well as developing countries like India. This is a virus born disease caused by breeding of Aedes mosquito. Datasets that are available for dengue describe information about the patients suffering with dengue disease and without dengue disease along with their symptoms like: Fever Temperature, WBC, Platelets, Severe Headache, Vomiting, Metallic Taste, Joint Pain, Appetite, Diarrhea, Hematocrit, Hemoglobin, and how many days suffer in different city. In this paper we discuss various algorithm approaches of data mining that have been utilized for dengue disease prediction. Data mining is a well known technique used by health organizations for classification of diseases such as dengue, diabetes and cancer in bioinformatics research. In the proposed approach we have used WEKA with 10 cross validation to evaluate data and compare results. Weka has an extensive collection of different machine learning and data mining algorithms. In this paper we have firstly classified the dengue data set and then compared the different data mining techniques in weka through Explorer, knowledge flow and Experimenter interfaces. Furthermore in order to validate our approach we have used a dengue dataset with 108 instances but weka used 99 rows and 18 attributes to determine the prediction of disease and their accuracy using classifications of different algorithms to find out the best performance. The main objective of this paper is to classify data and assist the users in extracting useful information from data and easily identify a suitable algorithm for accurate predictive model from it. From the findings of this paper it can be concluded that Naïve Bayes and J48 are the best performance algorithms for classified accuracy because they achieved maximum accuracy= 100% with 99 correctly classified instances, maximum ROC = 1 , had least mean absolute error and it took minimum time for building this model through Explorer and Knowledge flow results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Data Mining Approach for Precise Diagnosis of Dengue Fever

Dengue is a eviscerate disease common in tropical countries. It is also known as break-bone fever. Dataset for dengue gives information about the patient suffering with the dengue disease. The Dataset consist of attribute like fever, bleeding, metallic taste, Fatigue. The main objective of this study is to calculate the performance of various classification Techniques and compare their performa...

متن کامل

Dengue Fever Prediction : A Data Mining Problem

Dengue is a threatening disease caused by female mosquitos. It is typically found in widespread hot regions. From long periods of time, Experts are trying to find out some of features on Dengue disease so that they can rightly categorize patients because different patients require different types of treatment. Pakistan has been target of Dengue disease from last few years. Dengue fever is used ...

متن کامل

Data Mining in Educational System using WEKA

Data mining, the extraction of hidden predictive information from large databases, is a powerful new technology with great potential used in various commercial applications including retail sales, e-commerce, remote sensing, bioinformatics etc. Education is an essential element for the progress of country. Mining in educational environment is called Educational Data Mining. Educational data min...

متن کامل

Prediction of Depression among Senior Citizens using Machine Learning Classifiers

Depression among elderly population is an emerging problem of public health. Various socio demographic factors like age, sex, earning status, living spouse and family type etc are responsible for depression among senior people. Some co morbid conditions like visual problem, hearing difficulties, mobility problem also influence the disease. But depression can be diagnosed at earliest using predi...

متن کامل

Comparing the Performance of Data Mining Tools: WEKA and DTREG

The objective of the paper is to compare two data mining tools on the basis of various estimation criteria. The data mining tools which are evaluated are WEKA and DTREG. These tools are used to build multilayer perceptron which is a data mining model to predict the survivability of the oral cancer patients. Oral cancer database is considered as it is estimated to be 8th most common cancer world...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1502.05167  شماره 

صفحات  -

تاریخ انتشار 2015